CDS

Accession Number TCMCG042C18529
gbkey CDS
Protein Id XP_016450665.1
Location join(8070..8235,9889..10067,10986..11228,11541..11652,16701..16871,18387..18450,18664..18721,18819..18996,22406..22536,22830..22932,23036..23112,23742..23789,23876..24025,24386..24559)
Gene LOC107775450
GeneID 107775450
Organism Nicotiana tabacum

Protein

Length 617aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016595179.1
Definition PREDICTED: sterol 3-beta-glucosyltransferase UGT80A2-like [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category CG
Description Glycosyltransferase family 28 N-terminal domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K05841        [VIEW IN KEGG]
EC 2.4.1.173        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0008194        [VIEW IN EMBL-EBI]
GO:0016740        [VIEW IN EMBL-EBI]
GO:0016757        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGGATTCATTGGAAAAGAATAATAATGGGCTTGATCGACAGTTGAGTCCGTCGGGTGATTCGGGCGAGGTTCCGGTTGATTTCGAAGTGGAAATTGTACAGGGTGATAATGGGAATGAAATGCGCAAGGAGGCTGATGTTAATAGTATCAATATTGTTGAAGGATCTGTTTACGGGAATTCGGCTGGGCCTAGCACTACAAATGTGGAAAAGCCGGTGGAGAAATCAGAACTTGGTATCAGTCAACCGGTTAAAGCTGGAAGACGCAAACAAAACCAGAATCGGGCTCTAGGTCTGCTAGCAGCAAAGCTTTTTGATGATAAAGTTCCTTTAAGAAAAAAGCTCAAATTGTTCAATAGGCTTGCCACTGTTCAAGATGACGGCACTGTGCAATTTGAAGTTCCGGGGGATATTAAACCCGAAAAACTTGATTTTGGCACTGGAGTTGTTTACAATGGAGCTACAGTTGAAGCAGCGAATGATGTAGCCGATACACCAGAATTACCTCCATTGCAAATTGTTATGCTCATTGTCGGGACTAGGGGAGATGTGCAACCCTTTGTTGCCATTGGGAAAAAGTTTCAGGAAAGTGGTCATAGGGTGAGACTAGCAACTCATGCCAATTTTAGAGAGTTCGTCTTGAGTGCTGGATTGGAATTTTATCCTCTAGGTGGAGATCCAAAAGTTCTTGCTGCTTACATGGTAAAGAATAAAGGGTTTTTGCCATCTGGACCTTCTGAAATACATATTCAACGAAATCAAATAAAAGATATCGTATTCTCCTTGCTACCTGCATGCGTAGATCCTGATCCAGAGTCCAACGTTCCATTCAAAGTAAACGCCATTATTGCCAATCCTCCTGCATATGGACATATGCATGTAGCAGAGGCCCTGAAAGTACCATTGCATATATTTTTCACGATGCCATGGACGCCTACTAGTGAGTTTCCACATCCTCTCTCTCGTGTCAAACAGGCAGTTGGTTATAGACTATCATATCAAGTTGTTGATGGACTGATCTGGCTTGGGATTCGAGATGTGATAAATGACTTCAGGAAGAAAAAGTTGAAGCTAAGGCCAGTAACTTATTTGAGTAACTCCAACAGTTTCCATCCAGATGTGCCTTATGGGTATATATGGAGTCCGCACCTAGTTCCTAAACCCAAAGATTGGGGCCCAAAAATTGACGTGGTGGGCTTTTGCTTCCTAGACCTTGCTTCCAATTATGAACCCCCAGAAGAACTCGTTAAATGGCTTGAAGATGGTGAAAAGCCTATCTATATTGGCTTTGGAAGTCTTCCTGTTCAAGAACCTGAAAAAATGACCGAGATAATTGTTCAAGCTCTAGAAATGACTGGACAAAGAGGTATCATTAACAAAGGCTGGGGTGGCCTTGGGAACTTGAAGGAGCCAAAGGATTTTGTGTACCTGTTGGATAATTGCCCTCATGATTGGCTATTCCTGCAATGTGCTGCTGTGGTGCATCATGGAGGTGCTGGAACAACCGCTGCCGGACTTAAAGCTGCGTGCCCAACGACTGTGGTACCTTTCTTTGGGGATCAACCCTTTTGGGGGGAACGTGTGCATGCTAGGGGAGTTGGCCCTGCTCCCATCTCCGTGGATGAGTTCTCACTTGAAAAGCTGGTTGCTGCCATCCGCTTCATGCTAGATCCAAAGGTAAAAGAACGTGCTGTAGAACTAGCAAAAGCCATGGAGAATGAGGATGGAGTGACTGGAGCTGTGAAAGCATTCTATAAACATTTCCCTCGAGAATCACTTGAGCCTAAGCCCGAGATCTCACCTCATCCTCACCATTTCTTTTCCCTAAGACGCTGTTTTGGTCACACCTAA
Protein:  
MADSLEKNNNGLDRQLSPSGDSGEVPVDFEVEIVQGDNGNEMRKEADVNSINIVEGSVYGNSAGPSTTNVEKPVEKSELGISQPVKAGRRKQNQNRALGLLAAKLFDDKVPLRKKLKLFNRLATVQDDGTVQFEVPGDIKPEKLDFGTGVVYNGATVEAANDVADTPELPPLQIVMLIVGTRGDVQPFVAIGKKFQESGHRVRLATHANFREFVLSAGLEFYPLGGDPKVLAAYMVKNKGFLPSGPSEIHIQRNQIKDIVFSLLPACVDPDPESNVPFKVNAIIANPPAYGHMHVAEALKVPLHIFFTMPWTPTSEFPHPLSRVKQAVGYRLSYQVVDGLIWLGIRDVINDFRKKKLKLRPVTYLSNSNSFHPDVPYGYIWSPHLVPKPKDWGPKIDVVGFCFLDLASNYEPPEELVKWLEDGEKPIYIGFGSLPVQEPEKMTEIIVQALEMTGQRGIINKGWGGLGNLKEPKDFVYLLDNCPHDWLFLQCAAVVHHGGAGTTAAGLKAACPTTVVPFFGDQPFWGERVHARGVGPAPISVDEFSLEKLVAAIRFMLDPKVKERAVELAKAMENEDGVTGAVKAFYKHFPRESLEPKPEISPHPHHFFSLRRCFGHT